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Single top-quark production, via weak-interaction processes, is an important test of the standard 
model, potentially sensitive to new physics. However, it is becoming known that this measurement 
is much more challenging at the Tevatron than originally expected. We reexamine this process 
and suggest new methods, using shape variables, that can supplement the methods that have been 
discussed previously. In particular, by focusing on correlations and asymmetries, we can reduce 
backgrounds substantially without low acceptance for the signal. Our method also allows for a 
self-consistency check on the modeling of the backgrounds. However, at the present time, serious 
systematic problems remain, especially concerning the background from W-plus-jets; these must be 
studied further by experimentalists and theorists alike. 
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I. INTRODUCTION 



The electroweak production of single top quarks is an important standard modelproeess which the Tevatron is 
guaranteed to be able to study. This reaction, which has been investigated previously P,l3i is interesting both because 
it provides a direct measurement of the Vtb CKM element and because it is sensitive to deviations of top quark physics 
from standard model predictions, in particular, through their effects on the top-bottom- vertex Limits on single 
top production from Run I at the Tevatron have been published |^ , and the first Run II limits have appeared 
In this article we discuss methods which we hope will improve the significance of the measurement by using 
more information encoded in the shape of the signal, in a way that will be less sensitive to systematic errors than a 
simple counting experiment. However, we also show that the size of the M^-plus-jets background, and the difficulty of 
predicting it accurately, represent a serious obstacle. 

Single top production is a very unusual process. At Fermilab energies, the "i6" production of a top quark and 
bottom antiquark by an s-channel W boson, as can occur through the diagram in Fig. has a lower cross-section 




FIG. 1: Single top quark production via (a) an s-cliannel W boson; (b) a f-channel W boson. 



than "t6g"t-channel W boson of a i, b, and an extra light-quark jet near the beam axis, as occurs through diagrams 
such as that in Fig. The tbq process has a distinct shape, both because of its unusual initial state, the hard 

light-quark jet (which has large pT and large pseudo-rapidity) in the final state, and correlations between this jet and 
the lepton from the top decay. In this paper, we will explore a method for using these features to help separate single 
top from its major backgrounds: tt, W plus jets, and QCD events. 

This separation using the shape of the event is essential, because a simple counting experiment is extremely difficult 
to carry out. Both our studies and recent data indicate that the size of the relevant VF-plus-jets background is larger 
than anticipated a few years ago.P,|2 This is compounded by other issues, such as the 2% decrease in Fermilab's 
energy compared to expectations, and lower cross-section calculations for the signal We are concerned that 

systematic errors in the understanding of the background will plague a direct counting experiment at a level that 
will make any claims of discovery suspect. Thus, in our view, additional methods, independent of (but perhaps to 
be used in conjunction with) a counting experiment, are needed even for a discovery of the process, as well as for a 
precision measurement. We will argue that it is necessary, and possible, to reduce backgrounds further by using more 
information about the final states. (Note that a shape fit using a single variable was used in 0,0.) 

However, even with these improvements, our results suggest that the measurement of single top remains challenging. 
We have found that there is no simple way, even with 3 fb~^ of integrated luminosity, to achieve both good statistics 
and a satisfactory signal-to-background ratio. The essential problem, compared to earlier and more optimistic studies 
0, is that the VF-plus-jets sample with a single ^-tagged jet is larger, less predictable, and more variegated than was 
understood a few years ago. We will discuss in some detail the difficulties with this background, and the need for a 
wide range of efforts to bring it under control. 

The organization of this paper is as follows. In Section II we discuss the general structure of the events expected 
for both signal and background events. Section HI includes a detailed discussion of how we have simulated both 
types of events, the cuts used to define the event samples, the expected (and observed) differences between signal 
and background event shapes, and new observables intended to highlight these differences. In Section IV we address 
the essential issues of uncertainties, both statistical and systematic, with a special focus on the difficulties inherent 



There is, as always, some ambiguity as to whether the initial state contains a gluon which splits into a b and b as part of the scattering 
process or whether the initial state contains a b parton directly, i.e., the splitting process is part of the incoming wave function. Since 
the b jet is generally not used in the analysis below, a careful examination of this separation is not essential for us. 
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in understanding the background arising from W boson production accompanied by jets. In the final section we 
summarize our analysis and our conclusions. 

II. THE BACKGROUNDS TO SINGLE TOP PRODUCTION 

From Fig. ^ one can see that a tbq event has a final partonic state consisting of at least the following: a charged 
lepton, a neutrino, b and b quarks, and a light quark. The bottom quark that comes directly from the initial gluon 
tends to have small transverse momentum, and is rarely in the pseudo-rapidity and px range necessary to be identified 
by ^-tagging algorithms. Thus, in selecting t-channel signal events, one asks for (a) one or more 6-tagged jets, (b) 
significant missing energy (from the neutrino), (c) one and only one isolated or /j,*, (d) at least one untagged jet. 
Typically, the highest-pr untagged jet in a t-channel event is that from the light quark. Also, in typical events a t 
quark can be reconstructed from the tagged jet and a W, itself reconstructed from the charged lepton and the missing 
energy. The kinematics of the event tend to prefer a total visible scalar-summed transverse energy of order mt . 

The tb process has a b quark jet and a b jet, along with a lepton and a neutrino. In a significant number of events, 
one of the two 6-jets will not be tagged, so that the same criteria used for tbq — one 6-tagged jet, missing energy, 
a charged lepton and an untagged jet — will have moderately high efficiency for this process as well. Since the tb 
process is smaller in cross-section and considerably less distinctive in shape than tbq, and therefore harder to separate 
from background, it makes sense for us to optimize our approach for tbq. We will not, in this paper, discuss the 
measurement of the tb and tbq channels separately, as this will only be possible well after the initial measurements of 
the combined production process. 

The main backgrounds to single top production, which all can imitate the signature just described, are (1) 
"it" , top quark pair production, primarily from events where only one of the top quarks decays leptonically, but also 
from events with two leptonic decays; (2) "QCD" events with fake missing energy and with either a fake lepton or 
a lepton from a heavy flavor decay; and, most problematic, (3) 'Wj"" events with a leptonically-decaying W plus 
some number n > 2 of quark or gluon jets. 

A. Top pair production 

Top quark pair production is a formidable background to all single top channels. After both top quarks decay, there 
are two high-pT b quarks, which typically give rise to at least one 6-tagged jet, and two W bosons which can decay 
hadronically or leptonically. The signature for single top will be mimicked if only one W decays leptonically, or if both 
do and only one charged lepton is detected. The present measurement of the cross-section for tt production has a large 
uncertainty, though this uncertainty is expected to drop to around 10% [Tlj by the end of Run II. However, even this 
systematic uncertainty would prohibit observation of single top above the tt background, so a substantial amount of 
tt must be cut away. The contribution of tt with one hadronically decaying top is especially problematic, since often 
the leptonically-decaying top quark can be reconstructed. On the other hand, this background can be reduced using 
the fact that these events tend to exhibit considerably more transverse energy than true single top events, to have 
more jets, and to be more spherical. These kinematic handles are not present for events where both top quarks decay 
leptonically, but such events are suppressed by both the leptonic decay branching fraction and our requirement that 
only one or be observed in the detector. Furthermore, the accurate reconstruction of a leptonically-decaying 
top quark is more difficult in such events. 

B. QCD backgrounds 

Pure QCD events can give rise to fake leptons and provide heavy-flavor jets (or jets which have no heavy-flavor but 
are mistagged.) As fluctuations in energy measurement can lead to a fake missing Ex signal, these QCD events form 
a background to single top production. While the energetics tend to be lower than in signal events, and the invariant 
mass of the "lepton" , missing energy and 6-tagged jet exhibits no peak at rrit, the number of events is so large that it is 
not obvious, without data (or a complete detector simulation), whether this represents a relevant background. Recent 
results from DZero 7] indicate that the number of QCD events entering their single-top samples (defined by somewhat 
different cuts than used here) is smaller than for other backgrounds, and is, in first approximation, comparable in size 
to the single-top signal. This conclusion depends in detail on cuts and on the flavor of the lepton (muons being more 
prevalent than electrons in their samples) and may differ for CDF. As we will see, the methods that we describe below 
provide substantial further reduction of this QCD background and the systematic errors associated with determining 
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it, to the point that it should not pose a serious issue. Consequently, we will largely disregard the QCD contribution 
to the sample, except for a discussion in Sec. III.D as to when and why this is justified. 

C. W plus jets 

The W^-plus-jets background is much more challenging. The W^j" events potentially entering the sample consist 
of a real W boson decaying leptonically, and at least two other quarks or gluons in the final state. While the Wj" 
events do tend to have smaller energetics than single top, and do not have a reconstructible t quark, the number of 
events is so large, and the energy resolution at Fermilab is sufficiently broad, that the Wj" events form a large and 
problematic background to single-top production. 

The difficulties involved in simulating Wj"^ events have been discussed elsewhere ^i^^ '^ot give a 

full review, but it is important to note that there are special problems for the sample with a single 6-tag that neither 
untagged nor double-tagged samples suffer from. In particular, many different partonic processes, with different 
shapes, contribute to the sample in a fashion which is difficult to predict accurately. We will discuss this in detail in 
section IV. D. As we will see, reducing the systematic error on the prediction and/or measurement of this process is 
essential for success. 



III. SIMULATIONS, CUTS, SHAPES AND METHODS 
A. Simulation and Kinematic Cuts 

To model both signal and background, we have used MadEvent 14^ to generate events, Pythia to then simulate 
showering and hadronization (using the default value of the initial shower scale, V^), and PGS to act as a fast 
detector simulation. The single top s- and i-channel cross-sections have been normalized to 0.88 pb and 1.98 pb 
respectively and generated with factorization and renormalization scales /i — rrit = 175 GeV. For the process, 
the cross-section is normalized to 6.7 pb [T^ and generated with factorization and renormalization scales also at 
jjL = mt = 175 GeV. For the W^j" channel, we have limited our simulations to VF-plus-two-jets (henceforth Wjj) at 
the MadEvent level, since we believe the uncertainties in our simulations of this process are as large as the contribution 
at the next order in . Samples for the Wjj channel were generated using a renormalization and factorization scale 
/i — Mw I'^i which is selected to give the correct normalization at next-to-leading order 0|. We employ cuts at 
the MadEvent level of pTji, PTj2 > 10 GeV (where ptj. is the transverse momentum of the jet), with angular 
separation Ai?(ji, j2) > 0.4, and |77j| < 4.0. 

The 6-tagging is simulated as follows. Jets containing a h quark (either perturbatively or produced during showering) 
are taken to be tagged with an efficiency of the form 0.5 tanh (pt/36 GeV), where pt is the transverse momentum of 
the jet. Jets containing a c quark (either perturbatively or produced during showering) are taken to be tagged with a 
rate of the form 0.15 tanh (pt/42 GeV), while jets containing no heavy flavor are taken to be mistagged with a rate 
of the form 0.01 tanh (pt/80 GcV).^ 

From the events generated in this fashion, we define our initial event sample to be those events that contain one 
and only one isolated charged lepton, missing energy, one or more 6-tagged jets, and one or more untagged jets. 
These objects satisfy the "basic" cuts listed in Table The pr constraints in this and later tables apply only to the 
highest-pr ^-tagged jet and the highest-pr untagged jet. Additional cuts are necessary in order to bring backgrounds 
down to a reasonable level. 

Given that Wj" tends, compared to the signal, to have lower energy and fewer jets, and that tt tends to have 
higher energy and more jets, there are two natural choices of variables to cut on that take advantage of the overall 
kinematics of the events without appealing to event shapes. One possibility is to cut on the total transverse energy 
of the event; a second would be to cut on the number of high-p^ jets. The question of which of these (or whether a 



^ There is no agreed-upon convention for the Hght-quark/gluon mistagging function. The tanh form of the mistagging function is not 
what is used in the default PGS detector simulation, which instead is —5.54 X 10~^ -I- 1.66 X 10~'^(pt/1 GeV)^'^^. Based on the fact 
that this function leads to large mistagging rates at very high px, in contradiction to measurements at CDF Il9l . we have changed 
this mistagging function within PGS to a tanh form. The overall mistagging rate that we use is larger than assumed in other papers, 
including jj] and the recent work of Ij, where the size of VFj" backgrounds is smaller as a result. The true form of the mistagging 
function is detector- and algorithm-dependent, and different mistagging functions do change the shape of certain distributions. Our 
results suggest that uncertainties in this function lead to important, but not dominant, systematic uncertainties in the Wj" background. 
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Item 


VT 


V\\ 




> 15 GeV 


< 2 


MET {v) 


> 15 GeV 




jet (&-tag) 


> 20 GeV 


< 2 


jet (no &-tag) 


> 20 GeV 


< 3.5 



TABLE L Basic cuts for initial sample. Here px is the transverse momentum and ri is the pseudo-rapidity. 



Item 


PT 




^± 


> 15 GeV 


< 2 


MET (u) 


> 15 GeV 




jet (6-tag) 


> 20 GeV 


< 2 


jet (no fe-tag) 


> 20 GeV 


< 3.5 




Min 


Max 


Ht 


> 180 GeV 


< 250 GeV 


"mt" 


> 160 GeV 


< 190 GeV 



TABLE II: Representative "intermediate" cuts. 

combination thereof) has the lowest theoretical uncertainties has not been resolved and we will not attempt to answer 
this very important question definitively here. For this study we have chosen to cut on the quantity 

Ht = ^(pt)j + iPT)e , 

jets 

where the sum is over all jets with px > 20 GeV and |?7| < 3.5, {pT)i is the magnitude of the transverse momentum 
of the I*'' jet, {pT)i that of the lepton, and^r is the missing transverse energy in the event. Large values of Ht tend 
to favor tt final states, while lower values favor VFj" final states, with the signal contribution peaking at intermediate 
values. 

Furthermore, since the signal involves a i-quark, we also impose a requirement that the invariant mass of the 
lepton, neutrino, and the leading tagged jet be approximately equal to the top quark mass. In doing so, we must 
reconstruct the neutrino's momentum component p^,z along the beam axis, which has an ambiguity. We require that 
(pe +Pv)'^ — '^w'' ^'^'^ among the two solutions for p^,z we choose the solution with smallest absolute magnitude. (For 
complex solutions, only the real part is used.) 

As suggested earlier, we find that such cuts cannot decrease the backgrounds to the point that they are comparable 
to the signal. Two choices of "intermediate" and "hard" cuts are indicated in TableslTlland lllll The resulting numbers 
of expected events for an integrated luminosity of 3 fb^^, summing over and p,^ (and thus including both t and i), 
can be seen in Table Hvl We show the number of events which survive both the basic cuts of Table HJ the intermediate 
cuts of Table ^1 and the hard cuts of Table IIIII Consistent with 1\ , we find that while all of the cuts contribute 
to the background reduction, the Wf^ channel is reduced primarily by a combination of the stifi'er pt cuts and the 
"mt" cut, while the tt background is affected primarily by the upper Ht cut. While the basic cuts reveal a signal to 
background ratio of approximately 1:21, this improves to 1:7.4 and 1:4.9 using the intermediate and hard cuts. This 
is, at best, disappointing. 

The difference in the results between our study and that of I'l is striking, and requires an explanation.^ We believe 
the main effects can be accounted for straightforwardly. First, in the present study we have the benefit of recent 



Due to our different cuts and somewhat different approaches, a detailed comparison between the results of the two analyses is difficult. 
To obtain some quantitative sense of the differences let us focus on the ratio of signal {tbq + tb) to Wjj background, which is where 
the bulk of the difference arises and which provides an upper limit on the full signal to background ratio. For example, consider this 
ratio for the "basic cuts" results in Table lTVl vielding a ratio of 0.065. This is most usefully compared with the middle column (without 
parentheses) in Table 3 of 0, where the corresponding ratio is 0.24. The factor of nearly 4 difference results primarily from our much 
larger estimate of the Wjj contribution (larger by more than a factor of 3). Adding the mt cut in (l| increases their value for the ratio 
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Item 


PT 






> 15 GeV 


< 2 


MET (j/) 


> 15 GeV 




jet (&-tag) 


> 60 GeV 


< 2 


jet (no fe-tag) 


> 30 GeV 


< 3.5 




Min 


Max 


Ht 


> 180 GeV 


< 250 GeV 


"mt" 


> 160 GeV 


< 190 GeV 



TABLE III: Representative "hard" cuts. 



Channel 


Basic Cuts 


"Intermediate " Cuts 


"Hard" Cuts 


tbq 


298 


67 


30 


tb 


145 


27 


13 


tt 


2623 


140 


57 


Wjj 


6816 


550 


152 


{tbq + tb) 1 (tt + Wjj) 


0.047 


0.14 


0.21 



TABLE IV: Numbers of events for 3 fb ^ (summed over t and e and /i channels) for the three sets of cuts in Tables Hl jllll 

next-to-leading-order calculations Ts*!, which increase the overall rate for PFj" by of order 50 % compared to that 
used in 1]. (In our leading-order computations, this is effectively obtained through our lower choice of renormalization 
scale.) Second, we find a much larger number of tagged jets in the VFj" channel, because we include (by simulating 
parton showering) the fragmentation of leading-order partonic gluons into heavy flavor jets at subleading orders. (This 
effect would appear already in a next-to-leading-order calculation, such as performed recently in 9J.) Third, our more 
pessimistic estimate of energy resolution at the Fermilab detectors forces us to use a wider nit window cut in order 
to have sufficient acceptance for the signal; this lets in more VFj" background. Fourth, we use a more pessimistic 
^-tagging rate (50% vs 60%) and light-quark-jet mistagging rate (1.0% vs 0.5%), a more pessimistic charm-to-bottom 
ratio in tagging (1:3.3 vs 1:4.0), and more realistic pt distributions in tagging functions; these all hurt the signal-to- 
background ratio and the efficiency for the signal. Other small negative effects include a lower center of mass energy 
(1.96 TeV vs. 2.00 TeV), and a lower cross-section for the signal (due largely to a change in parton distribution 
bmctions.) Note also that we have used mt = 175 GeV, so our results may even be slightly optimistic in this regard. 

The numbers in Table IIVI suggest that a further factor of 3-5 improvement in the signal-to-background ratio via 
more aggressive cuts will come at the cost of a factor of 5-10 reduction in the signal accompanied by a factor of 15-45 
reduction in the background. The essential question then is whether a reduction by such a large factor can be achieved 
without large systematic and theoretical errors. From Table HVI one can see that systematic errors below about 10% 
in W^j" are needed for a discovery. Unfortunately, the method for removing the background suggested in [J , namely, 
to use a jet veto to reduce tt to a small contribution, and do a sideband analysis on either side of the mt window 
cut to remove VFj", is unlikely to work with such a large Wj" background. This is illustrated in Fig.|5| With basic 
cuts, the H^j" background (whose "mt" distribution falls steeply and monotonically above 125 GeV) is very large, as 
shown in Fig. [2^. A sideband analysis with a window centered around 175 GeV would be subject to large statistical 
errors. Meanwhile, the intermediate and hard cuts, shown in Figs. [2b and|2t, tend to shape the VFj" events such 
that the Wj" background is no longer monotonic near and/or across the mt window, making a sideband analysis 
problematic. Thus the shape of the "m^" distribution of the Wj^ background after hard cuts must be predicted, with 
small errors. We will argue later that this is very difficult. Consequently, we doubt that a straightforward counting 
experiment can yield, on its own, a satisfactory measurement of the cross-section for the production of single top. 



to 0.77. However, with our much larger Wjj contribution, we can improve this ratio to only 0.28 when using the "hard" cuts that 
include a cut on mt, i.e., the factor of 3 larger Wjj background is still there. The jet veto used in 1] (the last column in their Table 
3) only reduces the background from tt (and also the signal), and does not help reduce the Wjj background. 
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a) Basic Cuts 
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FIG. 2: Distribution of the reconstructed "mt", the invariant mass of the charged lepton, neutrino and highest-pT 6-tagged jet, 
for the backgrounds and signals. Additional details concerning the construction of this observable are given in the text. The 
three figures show the results for the (a) basic, (b) intermediate and (c) hard cuts given in Tables ITllllI with the cut on "mt" 
omitted. 



B. Shape Variables 

Under the assumption that a counting experiment is insufficient, we turn to observables that (as in ^|^) make 
use of other aspects of the signal. In particular, we will now explore variables that take advantage of the very special 
shape of single top production compared to the background, and are less subject to, or less sensitive to, systematic 
errors. 

As noted earlier, the dominant production process is tbq, in which there is a hard lepton, and also a hard untagged 
jet j with pseudo-rapidity I77I ~ 1 — 3 and pt typically larger than 25 GeV. This strongly forward or backward jet is a 
distinctive signature which the backgrounds do not share. The b jet from the t decay tends to be produced centrally 
(low pseudo-rapidity) with high-pT, and is typically the tagged jet (for which reason we only use the tagged jet in the 
t reconstruction.) The other b jet tends to have low px; it is often unobserved, and is rarely tagged 0. 

Importantly, the tbq process arises from an initial state light quark or antiquark (typically a valence quark carrying 
moderate to large Bjorken-a;) scattering off a gluon (typically carrying lower x.) This unusual initial state has 
important kinematic consequences. Because of this kinematic effect, the structure of the proton, and the details of 
the electroweak theory, the tbq signal has strong and distinctive correlations and asymmetries which we can use to 
separate it from the backgrounds. 

First, the unusual kinematics and the structure of the proton combine in an interesting way. The creation of 
positively-charged top quarks (and consequently positively-charged leptons in the final state) in the tbq process 
requires either a u or c? in the initial state, so that a can be emitted from the quark line. Since a reasonably large 
value of Bjorken-x is necessary in order to produce the top quark, the required initial state is most often obtained 
from a valence quark striking a gluon. The most likely initial state uses a valence u quark from the p; the next most 
likely draws a valence d from the p, and thereafter we must draw on the sea quarks from either p or p. Since the quark 
usually has larger x than the gluon that it strikes, the tbq system is typically boosted in the direction of travel of the 
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initial quark. Consequently, for t quarks, the tbq system is more likely to be boosted in the proton direction than in 
that of the antiproton, by a factor of roughly 2:1. The reverse is true for t production. Moreover, the light-quark jet 
in the final state, converted from the quark in the initial state by the emission of the W, tends to travel in the proton 
direction when a t is produced and in the antiproton direction when a f is produced. 

Thus, because of the differences between the u and d parton distribution functions in the proton, and because of the 
quark-gluon initial state, large asymmetries under parity P and charge conjugation C result. These show up strongly 
in the differential cross-section both for the top (and the positively charged lepton in its decays) and for the hard 
forward or backward light-quark jet. 

Second, the momentum vectors of the lepton and the light-quark jet are correlated, as a result of both kinematics 
and spin polarization effects. The structure of the electroweak interactions ensures that the spin of the top quark 
tends to align with the momentum direction of the light-quark jet. Since the top decays before this spin information 
is lost, it is transferred to the momentum of the final-state charged lepton. In the t rest frame the lepton momentum 
and jet momentum tend to align 0|. The boost of the top quark relative to the lab frame, whose sign is also aligned 
with the momentum direction of the jet, further tends to push both lepton and jet into the same hemisphere. 

These properties strongly distinguish the tbq process from its tt background. At tree-level, tt is separately C and P 
invariant. Clearly this is true of the process gg — > tt, since the initial state is P-invariant on average. It is also true of 
the process qq ^ g ^ ti, because the intermediate gluon state is a C-eigenstate; this is the same as in e^e^ — > fi^ , 
where there is no forward-backward asymmetry in the /i"'' distribution. The parity-invariance is violated at the next 
order, due to radiative effects |23|; this is a few percent effect, both small and calculable. Moreover, there should 
be no strong correlation between the momenta of the lepton and the jets in the final state. In those ti events which 
have both low Hj- and a reconstructable semileptonically-decaying top quark, one or two jets from the hadronically- 
decaying top tend to be lost or mismeasured. Meanwhile any high-px untagged jets whose momenta we might choose 
to compare with that of the lepton will also stem from the hadronically-decaying top. The accidents which lead to 
the selection of any given jet as part of our analysis should largely wash out any correlation of its momentum with 
that of the lepton. Indeed, our simulations show that in ti the correlation between the lepton and the highest-pT 
untagged jet (which we will use in our analysis below) is roughly a 10% effect. 

Similar considerations apply, to a good approximation, to those QCD events which might pass our cuts. Parity 
asymmetries for these events are small. The charge of a fake leptons is unrelated to its momentum direction. Con- 
sequently, all distributions for events with fake leptons are invariant under flipping the sign of the lepton charge; as 
we will see, this imples that these events have P-invariant distributions for the observables we will choose. Isolated 
leptons from heavy flavor stem mainly from cc and hh events; these have similar C and P properties to tt, so we expect 
small parity-asymmetries. Meanwhile, a fake lepton and the highest-pT untagged jet in the event should be essen- 
tially uncorrelated. However, if this jet contains heavy flavor, then a correlation can arise when the lepton observed 
in the event is from a wide-angle semileptonic decay of a heavy-quark within the jet. The precise size of jet-lepton 
correlations from this source is unknown to us, and is detector- and cut-dependent. The small overall number of QCD 
events entering the single top samples at DZero suggests that QCD contributions to jet-lepton correlations are not 
a major issue for the single top measurement, except possibly in the case of muons from heavy flavor decays. We will 
return to this possible exception in section III.D. 

The parity asymmetry in W j" unfortunately has the same sign as that of thq, although it is less pronounced. The 
reasons for this are easy to see. As with a t quark, a is most likely to be produced moving in the proton direction, 
since it is most often produced in a ud event. This leads to a well-known asymmetry in its pseudo-rapidity. When 
produced in conjunction with two jets, the still tends to be boosted in the proton direction, since its initial state is 
most often ug or ud. This leads to parity asymmetries which, though relatively small, are still quite large in absolute 
size compared to the signal.^ Unfortunately, the size of the asymmetries and correlations in W j"^ appears to be very 
sensitive to assumptions, cuts, Monte Carlo parameters, and tagging, and will be a source of significant systematic 
error. We will return to this issue later. 



C. Consequences of Parity and Correlations 

In order to make the best use of these special properties of the signals and backgrounds, it is useful to consider 
these issues more formally. We next discuss the effect of C and P (non)-invariance, and of lepton-jet pseudo-rapidity 



* Note these asymmetries in pseudo-rapidity, or equivalently angle, for fixed charge, are due to the Tevatron's proton-antiproton initial 
state. At the LHC, with a proton-proton initial state, the same effects will show up as charge asymmetries for fixed angle. 
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correlations, on two-dimensional distributions in pseudo-rapidity. The "jet" used throughout the analysis below is 
always the highest-px untagged jet in the event. 

The pp initial state of the Tevatron is a CP eigenstate, and so all distributions of final state particles are CP- 
invariant (to an excellent approximation, violated principally by the detector) . This means that the differential cross- 
section (Pa'^ / drjjdrji with respect to the rapidities of the jet and the positively charged lepton, and the corresponding 
distribution d^a^ / drjjdrii for processes with a negatively charged lepton, must be related by CP: 



drjjdrji ' dr/jdrje 

Consequently, we can combine data from positively and negatively charged leptons by defining a lepton- charge-weighted 
pseudo-rapidity, fij = Qirjj, fji = Qirji, where Qi is the lepton charge. (The variable fij was already introduced in 
For the remainder of this article, our entire discussion is based on the explicitly CP-invariant differential cross 

section 

Ji^k^^''^'^ " H'^'"' ^ ''''''' = + ^ ^ ■ 

However, the pp initial state is not an eigenstate of either C or P. Consequently, in general we expect parity-non- 
invariance 

dV+ dV+ 

and similar non-invariance under charge conjugation 

d^a+ d^a- 
drijdrji dr^jdrji 

(Indeed these two statements are equivalent due to CP-invariance.) In terms of the combined differential cross-section, 
P (and C) non-invariance implies 

-{m^m) t ,~ [-Vj,-m) ■ 



dfjjdfje dijjdije 

Conversely, if we were to study a parity- symmetric process, such as the tree-level production of tt, it would satisfy 

d^cr ^ d^a , ^ ^ . , . , 

,^ (Vj,Ve) = {-Vj^-m) (parity - even process) . 

arjjdTji arjjdrji 

Next, let us consider the effect of correlations. If the dynamics of a process is such that the jet and lepton directions 
are uncorrelated, then the differential cross-section factorizes into a product of two distributions, one for the jet and 
one for the lepton: 

d^fj 

— = f{'nj)g{f}i) (jet and lepton uncorrelated) . 
drjj drji 

Failure of this relation is proof of correlations. These might stem directly from correlations in the rest frame of the 
thq system. However, even if the distributions in the rest frame are uncorrelated, they will be correlated in the lab 
frame, once they are convolved with a distribution of boosts of the rest frame. ^ 



^ For example, suppose = f{Vj)9ifie) in the tbq rest frame, with Gaussian lepton and jet distributions: 

Suppose further that the rest frame is boosted by an amount r]i, with a propability which also has a Gaussian distribution 

p(»?t,) oc e-^^'i 

Then the observed distribution in the lab frame is 

oc 

dcr f N \ f \ -\{B+At)Ajfj^ + (B+Aj)Atfjj-AjA),fjjfie]/(Ai+At+B) 

— — OC / dr]t f{r]j - r?b) g (r/i, - p(%) oc e " :i j y jj i i j t j ij/y j j ^ 

drij drii J 

— oo 

This is a correlated distribution; it cannot be written as F{rij)G{r]i), because of the cross-term in the exponent. Note the correlation 
vanishes in the limit of a very narrow distribution of boosts, B — > oo, even if the boost distribution is not centered at zero. 
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Item 


PT 


\rj\ 


t 


> 15 GeV 


< 2 


MET {y) 


> 15 GeV 




jet (&-tag) 


> 40 GeV 


< 2 


jet (no fe-tag) 


> 30 GeV 


< 3.5 




Min 


Max 


Ht 


none 


< 300 GeV 


"mt" 


> 155 GeV 


< 200 GeV 



TABLE V; "Relaxed" cuts for analysis. 



Finally, if the process is both uncorrelated and P-invariant, then both distributions must be even functions of their 
particle's pseudo-rapidity. In short 

,^ = fiflj)9{iu) ; fiVj) = fi-Vj) ; gim) = gi-m) (uncorrelated, parity - even) . 
drijdr]i 

with the four-way consequence 

d^a ,^ ^ d^a ^ ^ d^a ^ d^a , ^ ^ . 

for such processes.^ The tt process does indeed satisfy the relation Q at the 90% level. We believe this continues 
to next-to-leading order: one-loop effects cause a 5% parity asymmetry in t and t production angles '21*1, which, 
when translated into jet and lepton pseudo-rapidities, is unlikely to violate parity by more than 10% (though this 
has not been simulated for our specific cuts.) We believe that QCD contributions to the sample are similarly in good 
agreement with except possibly for lepton-jet correlations in events with heavy flavor. The Wj"" background, 
with its moderate asymmetries and correlations, accords with ^ only to a very rough approximation. And as we 
have emphasized, the tbq signal strongly violates the relation 



D. Strategy and Tactics 



To illustrate the characteristic properties of the various channels, we will study a simulated event sample defined by 
the cuts in Table Ivl These cuts are more "relaxed" than those of Tables Ull and IIIll keeping a larger fraction of both 
the signal and the background, and yielding a sample which we believe is less sensitive to the systematic uncertainties 
stemming from the cuts. We will argue below that shape considerations will allow us to make some headway toward 
separating signal and background. The simulated differences in shape are summarized in the contour plots of Fig. |31 
which give the distributions {d'^a/df/jdrj^) of the various processes, plotted as functions on the (?7j,?7£) plane. (Recall 
that we define the "jet" of relevance to be the highest-pT untagged jet.) 

Figure illustrates the degree to which the various processes exhibit correlations and asymmetries. We label the 
four quadrants of the (?7j, ?7i) plane A, B, C, D as shown in Fig.0] Positive jet-lepton correlations cause events to pile 
up in quadrants B and C, while parity asymmetries appear in the differences between quadrants B and C, and between 
quadrants A and D. The tbq signal in Fig.|2l3 shows clearly shows both effects. The M^j" background, in Fig.|31i, also 
has an asymmetric shape, though to a lesser relative degree. The tb process. Fig. shows some correlation but no 
asymmetry, while Fig. (S); illustrates the uncorrelated and symmetric nature of (tree-level) tt. 

As a quantitative measure of these statements, we consider the differential cross-sections integrated separately over 
the four quadrants of the (77^, f)^) plane. For a given luminosity £, the number of tbq events in the A quadrant is 

£ X / dr]e drjj , 

Jo J-3.5 dr].jdr]e. 



Note the logic is not reversible; a distribution satisfying the condition is not necessarily uncorrelated. 
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a) tb 



b) tbq 




c)tt 



-3 -2-10 1 





0.00 pb 




0.001 pb 




0.002 pb 




0.005 pb 




0.01 pb 




0.02 pb 




d)Wjj 




-2-1012 
Ijet 



FIG. 3: Differential cross-section id? a /dfjjdfji, summed over t and t, e and /i) for the a) th channel b) tbq channel, c) tt channel, 
and d) W jj channel, after 6-tagging and the cuts of Table IVI 



A 


B 


C 


D 



FIG. 4: The four quadrants of the plane. 



Channel 


A 


B 


C 


D 


th 


8 


11 


12 


8 


tbq 


14 


41 


18 


22 


tt 


105 


109 


106 


105 


Wjj 


204 


207 


159 


180 



TABLE VI: Numbers of events for 3 fb ^ in the four quadrants for various channels (summed over t and t, e and n). See Fig.0] 
for definition and labels of the quadrants. 
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that in the B quadrant is 



£ X 



dfje 







3.5 ^ ^2^*69 



and so forth. The resulting numbers of events with £ = 3 fb~^ of data appear in Table fVTI (summed over t and i, 
e and ji). Statistical uncertainties in each bin are uncorrelated with other bins. Systematic uncertainties in these 
numbers, which are very substantial for Wj", deserve considerable discussion; since this table is merely intended for 
a general illustration, we defer this discussion until Sec. IV. 

To capture quantitatively these differences in shape between signal and background, we suggest defining three 
orthogonal functions in the (7)^^7)1) plane, based on the formal discussion of the previous section. For any differential 
cross-section, whether a signal, a background or a combination, and whether simulated or measured experimentally, 
we may write it as a sum of three components 



dfjjdfje 



where the components are of the form 



dfjjdfji 



d^a 
dfijdfii 



d^a 
dfjjdfji 



{flj,-f/{) + 



_fa_ 

df]jdf]t 



(2) 



(3) 



dfjjdfji 



dfjj dfji 



d^a 
dfjjdfj. 



d^a 
dfjjdfji 



(4) 



F-ivj^m) 



d^a 
dfjj dfji 



d^a 
dfjjdfji 



i-flj,-fl£) 



(5) 



Let us comment on some properties of these functions. First, they are explicit functions, not abstract devices: they 
can be directly constructed from any finite data set, simulated or measured. Second, they are orthogonal in the sense 
that 



dfjj / drjf FF± = 0; 



dfjj / dfje F+F^=0; 



(where a and b are arbitrary positive numbers); indeed this orthogonality applies in any symmetrically-defined region 
of the {fjj,fje) plane. Third, the functions provide important physical information about the symmetry properties of 
the differential cross-section. F and F+ are parity-even while F_ is parity-odd; thus F^ = (within statistics) for any 
parity-even distribution. Meanwhile, because of Eq. F^ will also vanish if the distribution is parity-even and the 
leptons and jets are uncorrelated. Fourth, by construction, these functions have special symmetries under reflection 
in the {fjj,fje) plane, as will be obvious in the figures below. F and F^ have four- way symmetry; in both cases, it 
is sufficient to know the function in any one quadrant to know it in all four quadrants. Meanwhile F„ has two-way 
symmetry; quadrants A and D are related, as are B and C, but quadrants A and B are independent and must be 
determined separately.^ 

In the problem at hand, the fact that the signal has strong asymmetries and correlations, while the backgrounds 
do not, is very usefully characterized using these functions. In particular, we expect, based on the properties we have 
discussed above, that 



F*'"^ ~ IF 



tbq. 



Our simulations further suggest that one can find cuts that are feasible at the Tevatron such that the backgrounds 
are still very large but only in F, with 



F" ^ F^J" ^ F""^ 



In short, F— contains twice as much information as and F. In principle one could further separate F— into two orthogonal functions, 
but this turns out not to be particularly useful. 
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while the signal has a much larger role to play in the other functions: 

especially away from the center of the {f}jTfli) plane. (The tb signal is smaller than the thq signal for all quantities, 
but especially for F+, by a factor of about 3, and for F_, by a factor of order 10.) 



a) tb b) tbq 





1 - 













-1 - 






-3-2-10123 

^m. 0.001 

0.002 
0.005 
0.01 

C) tt ^ 0.02 


-3-2-10123 
3b Ijet 

pb 

pb 

pb 
Db 

d) Wjj 




V 


1 

C 
O 

S. 






_ai 

-1 





-3-2-10123 -3-2-10123 
1j.t Ije, 



FIG. 5: Contour plots for the function F{f}j,f]i) for a) tb, b) tbq, c) tt, and d) Wjj channels (summed over t and e and /i). 

These claims are illustrated in Figs. El and [3 where the functions F,F^,F^ are shown, for tb, tbq, tt, and Wj^. 
(The reader should note that the scale for the contours in Figs. Eland [7| differs from that used in Figs. O and El this 
is because of the smaller dynamic range in the F± distributions.) The symmetry properties of the three functions 
discussed earlier are clearly evident. 

To make this comparison more concrete, the integrals of the 3 functions over quadrants A and B, for an integrated 
luminosity C — 3 fb~^, are presented in Table IVIll this table can be constructed from Table IVTl The definitions of 
the entries in the table are 

Fa = £ X df,e F = Fb =Fc = Fo, 

F+^A = £ X dfji, F+ ^ -F+^B = -F+^c = F+^o , 

F_^A = £ X Jj^drij dfii F_ = -F^,u , 
F_,B = £ X j^dfjj dfji F_ = -F+,c. 

The table also shows the statistical errors in these quantities. Since this table is intended only to emphasize qualitative 
points, we postpone discussion of systematic errors and next-to- leading order corrections until we outline a more 
sophisticated approach with better statistical errors. 

As we argued before, we are justified in disregarding QCD events. The number of QCD events entering the sample 
is small 0, 

pti^pWj" ^ pQCD ^ ptbq^ptb _ 
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FIG. 6: Contour plots for the function F+ {f]j,'i)i) for a) th, b) tbq, c) tt, and d) Wjj channels (summed over t and t, e and ^). 



Channel 


Fa = Fb 


F+,B = -F+,A 


F-,A 


F-,B 


tb 


9.8 ± 1.6 


1.4 ± 1.6 


0.0 ± 2.1 


-0.3 ± 2.4 


tbq 


23.8 ± 2.4 


5.6 ± 2.4 


-3.7 ± 3.0 


11.6 ± 3.8 


tt 


106.1 ± 5.2 


1.2 ± 5.2 


-0.2 ± 7.2 


1.3 ± 7.3 


Wjj 


187.7 ± 6.9 


-4.6 ± 6.9 


11.8 ± 9.8 


23.8 ± 9.6 



TABLE VII: Numbers of events for 3 fb ^ in F, F-f and F_ , integrated over quadrants A and B, for the various channels 
(summed over t and t, e and jj,). Errors shown are statistical only. See text for further interpretation. 

For the DZero detector, the number of events in the electron channel is smaller than the number in the muon 
channel. We expect lepton-jet correlations and parity asymmetries to be extremely small for fake leptons; for isolated 
leptons from heavy flavor, parity asymmetries are very small at tree-level, and small at higher-orders, while lepton-jet 
correlations might be a bit larger. Thus 

pQCD ^ \pQCD^ ^ \pQCD^ ^ 
even for isolated leptons. This leads us to expect 



except possibly for in the muon channel. The dominant source for will be from muons emitted at large angles 
during bb and cc events. The kinematics of these events can be studied using a double-tagged sample. We believe, 
therefore, that even if the contribution of QCD muon events to F+ is large enough to be a concern, its size and shape 
can be determined from the data. 
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E. Lessons and Caveats 



We can now extract some important lessons from Table rvTll Before doing so, we should comment on its limitations. 

First, our study is done entirely using tree-level short-distance matrix elements; only the normalizations are at 
next-to-leading order. We must therefore emphasize that the figures and tables in this paper are meant for illustration 
only. Next-to-leading-order corrections to the matrix elements will change the shapes of the background, in ways 
which contribute nontrivially to Table IVIll For example, we expect the above-mentioned next-to-leading-order effects 
on tt to affect our estimates of F", by something of order 10% of F", which is not negligible. Such changes are 
large enough that they must be calculated and/or measured, but are small enough that they do not invalidate the 
methodology we are outlining here. 

Second, it is clear from the table that statistical errors from the background are large in and F_, comparable to 
the signal. While this looks discouraging, it applies for the distributions integrated over the entire A or B quadrant. 
A quick examination of Fig. |21 shows that the situation is not quite as bad as it appears, if one considers the region 
away from the center of the (%,??^) plane, where the backgrounds are much smaller and the signals are still quite 
large. We will discuss this in much more detail in the next section, where we will do a more sophisticated analysis, 
but for the moment we simply note that the size of the statistical errors is misleadingly large in the above table. Still, 
we will see that the situation with statistics remains unsatisfactory. 

With these caveats in mind, we return to Table IVTll on the basis of which we suggest the following general approach. 
One should first construct, for both the data and the Monte Carlo simulation output, the F, and F_ functions. 
Using these functions, as well as information obtained from other measurements, one can systematically test one's 
understanding of each process. The F function allows a measurement of the sum of the backgrounds without much 
contamination from signal. We assume that the separation of Wf^ from tt can be obtained using the fact that 
the ti process can be measured and predicted with reasonable accuracy, using other data samples and Monte Carlo 
simulation. One can then cross-check one's understanding of the shape of the Wj^ background using the part of the 
F_ distribution (located roughly in quadrant A) where the signal is negligible. Finally, one can attempt to measure 
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the signal from and from a different part (located largely in quadrant B) of the F_ distribution.^ Effects on 
from QCD events with isolated leptons can be cross-checked by study of double-tagged events, and by comparing 
for electrons against F+ for muons. 

We now proceed to refine this approach, and to estimate the associated uncertainties. 



IV. UNCERTAINTIES AND OPTIMIZATION 



Our goal in this section is to show that the measurement of the signal using F+ and F_ is potentially feasible, 
though difficult. We begin this section with an overview that, using Figs. |H1 and El as guides, lays out our main points. 
We then turn to more detailed consideration of statistical and systematic uncertainties, especially those associated 
with W^-plus-jets. 



A. Overview 



FigurelHlshows the signal-to-background ratio for the differential cross-sections and for the three orthogonal functions 
that we have defined. Figure IHb shows the ratio 



tb 



ptbq _|_ p 



ptt pWj] 



according to our simulations. Nowhere in the plane is the signal-to-background ratio of order unity, implying that 
sensitivity to even small systematic errors is severe. Instead, F should be viewed as insensitive to the signal, and 
therefore mainly useful in helping determine of the size of the backgrounds. 
Figures IHt and|Sli show the analogous ratios 



ptbq ^ ptb 



F" + F_ 



and 



ptbq _|_ p 



tb 



F" + F_ 

(We use absolute values for these two functions, as both numerator and denominator can be negative.) For both F^ 
and the signal-to-background ratio reaches unity in some parts of the plane, implying that, for sufficiently high 
statistics, the signal can be measured in these two observables even if the background has relatively large systematic 
errors. In both cases, the dark regions are the best ones for the measurement; the other regions should be cut away. 

In Fig.|51the ratio of signal to the square-root of background-plus-signal is plotted, for a integrated luminosity of 3 
fb^^. Figure shows 



ptbq _|_ p 



tb 



( F" + F^ii + F*'"i + F^^) 



1/2 



while Figs.int andlHtl show 



and 



\P^bq^ptb^ 
1 ( ptt _|_ pW]] ptbq _|_ ptb'^ 



<ptbq _|_ ptb^ 



J-fptt _L. pWj] I ptbq I ptb I ptt I pWjj , ptbq ptb 

\ + + + + 



1/2 



Note that the study in ^ measures half this information; it is able to measure F and the difference of quadrants A and B in F— 
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FIG. 8: Signal-to-background ratio S/B across the {i)j,r)i) plane, showing, within each 1.16 x 1.0 bin in rjj x rji, the ratio of 
signal to background for a) j d^a/drjjdrji, b) F, c) \F+\ and d) 

(Note that, by construction, F > F+ for any distribution, so the expression under the square-root is always positive.) 
The form of the denominators (and the factors of 1/2 and l/-\/2) follow from ordinary Gaussian statistics for each 
bin, using the definitions of F and F±, Eqs. ©^lEl, as linear combinations of statistically independent bins.^ The 
possibility of aggregating bins into larger regions, in which these ratios are of order 1.2, is illustrated using the dot-dash 
outlined bins in the case of F+ and F_ . 

Figures |H1 and |51 show that there is nonzero overlap, for both and _F_ , between the region where the statistics is 
best and the region where sensitivity to systematic errors is small. Moreover, we learn that the two measurements 
should use data predominantly from certain "windows" in the {f]j,f}() plane. The region where the jet has small \f}j\ 
has severe problems with statistical background and sensitivity to systematics. The measurement should be made, 
roughly speaking, by using all data for fjj > 1 and any fji, (recall that the regions with either or both fj negative 
are redundant in the case of F+.) Meanwhile the F_ measurement of the signal should be made, roughly, by cutting 
away all but the region fjj > 1 and fji > —1 (the mirror-symmetric region fjj < — 1 and "fji < 1 being redundant in this 
case.)^'^ 



For instance, F_, defined in Eq. jSJ, satisfies 



1/2 



Since the two terms in the right-hand side are uncorrelated. 



V dfjj drje 



— (Vj,Vl) + 



V df]j dfii ^ ' ' 



where we used Eqs. and jlj. This explains the formula used in Fig. l9li. 

^'^ We do not mean to imply that we are specifying the precise form of these windows. Their shapes must be optimized for each particular 
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FIG. 9: Statistical significance S/ \/B + S, for an integrated luminosity of 3 fb~^, across the plane. Within each 1.16x 1.0 

bin in rjj x rji , the number of events in a) J d^a/ drjj drje , b) F, c) | F+ \ and d) | F_ | , divided by the appropriate square- root of the 

number of events in background plus signal (see text for formulas.) For each box outlined in dot-dashed lines, the accompanying 
number indicates the improved value of S/ \/ B + S that results when the bins inside the box are combined. 

Let us also call attention to the region ffj < 1 and fji > 1 for (ignoring the redundant mirror-symmetric region) , 
away from the center of the plane but also outside the above-mentioned F_ window. Here, the signal is very small. 
This region is useful for a check of the modeling of the Wj^ background. In the standard model, the contribution of 
F_ should be consistent with the Wj" background, with little tt and QCD background, and no measurable signal. 
Verifying that this is the case is both a cross-check on the analysis and a test of the standard model, so the measurement 
of F^ in this region is also very important. 



B. Statistical Errors 



To explore the size of statistical errors, we have carried out two unsophisticated likelihood analyses, one optimistic, 
one overly pessimistic. In both cases, we assume 3 fb~^ of integrated luminosity. We assume that the shapes of 
the backgrounds and signals are known, and we fit for their normalizations. This is not quite appropriate, since 
some information about the normalization of the backgrounds will be known from other sources, and since there are 
significant uncertainties in the shape functions, especially for Wj", but we believe the measure we obtain is not far 
from the truth, and that the correct lesson can be extracted. 

In the first likelihood analysis, somewhat optimistic, it is assumed that the shape of the distribution, in the (?7j, fji) 
plane, of the sum of all backgrounds, tt plus Wj" (plus QCD) is known. Similarly, it is assumed that the shape of the 



analysis, based on the associated backgrounds, acceptances, detector issues, and integrated luminosity, as well as improved simulations. 
Also, no window cut need actually be used; a fit or neural network will naturally weight the data inside the windows heavily, while 
down- weighting the regions outside the windows. 
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signal, tbq plus tb, is known. Maximizing the likelihood as a function of the relative normalization of the signal over 
background, we find that the measurement of signal over background can be made to a precision of about 40 percent. 

For the second likelihood analysis, it is assumed that the shape of tt (plus QCD) is known, the shape of Wj"' 
is separately known, and, as before, the shape of the signal tbq plus tb is known. We then fit for the two relative 
normalizations, and find that the measurement of the signal over the background can be made to a precision of about 
50 percent. Meanwhile the measurement of the relative normalizations of the two backgrounds can be performed with 
an uncertainty of order 15 percent. 

Both of these analyses show that statistical uncertainties are large even for £ = 3 fb^^, probably comparable to 
or larger than the theoretical uncertainties in the shapes of the distributions for the signals and backgrounds. To the 
extent that normalizations of the backgrounds can be pinned down using other information, the situation may be 
slightly better than this estimate suggests. 

Intuitively, from the figures, the region in the center of the (?7j , fji) plane plays a large role in fixing the backgrounds, 
while the large-^j region of quadrant B plays a large role in fixing the signal. Indeed, as noted earlier, the statistical 
significance of the measurement in the signal comes dominantly from the regions outlined in dot-dashed lines in Fig.lHl 
Both measurements of F+ and F- in these "windows" have S/\/B + S of order 1.2, reasonably consistent with the 
above likelihood analyses, which were applied to the full distributions over the entire plane. 

C. Systematic Errors: General Comments 

We now turn our focus to systematic uncertainties, which mainly stem from an inability to predict and simulate the 
backgrounds. Our use of the and F^ functions, we will argue, helps us reduce the sensitivity of the measurement 
to systematic uncertainties. However, the systematic errors on the backgrounds are very large at present, and must 
still be reduced. 

Earlier in this section, using Fig. [SJ we discussed the signal-to-background ratios for the various component 
functions. We noted that Fig. IHt indicates that F has a poor signal-to-background ratio. Although it has been 
argued that systematic uncertainties in predicting F, using a combination of Monte Carlo simulations and data, will 
not be large, even a 15 percent systematic uncertainty is already enough to make F problematic for measuring the 
signal, no matter how good the statistics. Instead, it is better to treat F as a measurement which, along with other 
inputs, is used to help fix the normalizations of the backgrounds.^^ 

Fig-IHt andlHtl show that the situation with F± is much better, outside of the central region of the plane where the 
background peaks. -F+ may receive some additional contributions which we have neglected. There is a QCD contribu- 
tion, which we have argued is probably small, especially in the electron channel. Detector effects and correlations from 
cuts can also contribute to F+. This means the signal-to-background ratio in the figure may be an overestimate.^^ 

Meanwhile, F_, the P-odd C-odd observable, has the feature that many potential sources of systematic uncertainty 
largely cancel. We expect QCD backgrounds, correlations from cuts, and effects of detector cracks or damage to be 
small in this variable, since they are largely P-even and/or C-even. Many uncertainties in simulating tt and even, to 
a degree, Wj"^ will have substantial cancellations. This makes this variable especially compelling. 

Eventually, the contribution of tt to F± should not have large systematic uncertainties. It is important to keep 
in mind that there are non-negligible effects, including a parity asymmetry |21| |. that appear at one loop. Next-to- 
leading-order calculations of the distributions in the (Tyj,^^) plane are needed. These can be computed with small 
errors and should allow a precise background subtraction of ti. At present, however, their absence causes a substantial 
uncertainty in this subtraction. 

For both F^ and F_, the most serious systematic problems stem from the uncertainties in the shape of Wj^. For 
this reason, we will now present an extended discussion of this background. 



In the counting experiment discussed in Sec. III. A, one essentially uses F to make the measurement, though with much tighter cuts 
than the "relaxed cuts" employed here. 

Moreover, it appears in the figure to be somewhat better than we actually expect it to be, perhaps by as much as a factor of two 
in any given bin. In our simulations there is some accidental cancellation between the Wj" and tt contributions to see Fig. ISl 
Unfortunately our knowledge of these two backgrounds is too poor to be certain that this cancellation is robust, and moreover statistical 
fluctuations may also ruin the cancellation. 
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Wjj Channel 


(T(before tags) [fb] 


(7(after tags) [fb] 


Fraction tagged 


Wqq 


16, 470 


192 


1% 


Wan 




732 


2% 


Wqq 


14, 760 


484 


3% 


Wcq 


3200 


318 


10% 


Wcg 


2240 


238 


11% 


Wcc 


600 


104 


17% 


Whb 


496 


224 


45% 


Total 


69766 


2291 


3% 



TABLE VIII: The cross sections for W events with at least two jets, before and after tagging of one of the jets. Each row 
refers to the combination of processes with the tree-level final state shown in the left-most column. The Wcq and Wcg entries 
include both c and c quarks; the symbol q stands for u, d, s, u, d, s. After showering, hadronization and jet identification, and 
application of the cuts in Table |I] the resulting cross section is indicated in the next column. The cross section corresponding 
to a single tagged jet is shown in the third column, while the last column shows the ratio of the previous two. Simulation 
uncertainties in these numbers are of order ±1-3%, except for the Wqg and Wqq channels, with uncertainties of order ±4% 
and ±8%, respectively. The (much larger) systematic uncertainties are discussed in the text. 

D. Systematic Errors: Predicting I¥-plus-jets 

One might hope that systematic errors stemming from the Wj^ background could be greatly reduced, for the 
variables F±, using a sideband subtraction in the variable "mt". This approach, analogous to the method suggested 
in for use in a counting experiment, would allow a background subtraction without much theoretical input. 
However, any such attempt will run into the same issues that cause this method to fail for a counting experiment. In 
Fig. 13 we showed that the application of aggressive cuts to bring Wj^ under statistical control simultaneously makes 
a sideband analysis problematic by deforming the shape of the Wj^ background, leaving it non-monotonic across the 
region needed for the sideband analysis. To do a subtraction therefore requires that the shape of the background, 
after cuts, to be accurately predicted^ using a combination of theory, Monte Carlo and data. Unfortunately, prediction 
of any aspects of W^j", especially with one or more ^-tagged jets, is very difficult indeed. As we will now discuss in 
detail, we believe that Monte Carlo results for the Wj^ sample with 6-tags cannot currently be trusted at the level 
that is likely to be needed. 

The shape of the H^j" background is plagued with the usual concerns about the inability of Pythia or Herwig 
to reliably generate the correct pattern of additional radiated jets; for recent discussion, see |23- But to this and 
other typical problems, which are known to be issues in many processes, we must add some others which are specific 
to the sample with one (and only one) 6-tagged jet. (Note the event samples used in our single-top analysis above 
include events with one or more tagged jets; however, the problems detailed in this section are somewhat less severe 
for samples with more than one tag.) 

A striking feature of this sample is that only a moderate fraction of Wj^ events with a single tagged jet (and 
passing our cuts) actually have a short-distance 6-quark parton present in the hard-scattering process; those that do 
are mainly Wbb. Instead, the single &-tag sample is composed of many different contributions. This situation is 
made explicit in Table IVlTll which shows the cross-section for various Wjj channels, before and after the requirement 
of a single tag, and Table Hxl which lists the relative contributions from the various Wjj channels to the single tagged 
sample. As with all our simulations, these contributions were calculated using Madgraph to evaluate the parton 
cross sections, Pythia to provide showering and hadronization, and PGS as the detector simulation of jet 
identification and tagging. The total VFj" cross-section (before tagging) was normalized to the one-loop result |18l |. 
The basic cuts of Table|2were used; note that an untagged jet was also required in the event. The labels Wxy in the 
first column indicate the perturbative final state at tree-level (as evaluated in Madgraph); here q represents a light 
quark or antiquark (w, d, s, u, d, s). In Table IVIIII the cross-section for each channel, before and after the requirement 
of a single tagged jet, is shown; also shown is the ratio of after-tagging to before-tagging {i.e., the fraction of each 
channel containing a single tagged jet.) In Table Hxl the three central columns divide the tagged events by whether 
the jet that was tagged in the event contained a bottom hadron, a charm hadron, or no heavy flavor. For example, 
the entry in the 6-jet column of the Wqg row indicates that 11 percent of the entire Wj" single-tag sample arises 
from Wqg events {q = m, d, s, u, d, s) in which the tagged jet contains bottom hadrons, mainly due to the splitting 
g bb in the parton showering. In determining the entries in Table Hxl from those in Table IVIIII we have attempted 
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Wjj Channel 


6- jet 


c-jet 


non-6/c-jet 


Total 


Wqq 


2% 


1% 


6% 


9% 


Wnn 
vv qy 


11% 


8% 


14% 


33% 


Wqq 


7% 


5% 


5% 


17% 


Wcq 


0% 


14% 


1% 


15% 


Wcg 


1% 


10% 


0% 


11% 


Wcc 


0% 


5% 


0% 


5% 


Wbb 


10% 


0% 


0% 


10% 


Total 


31% 


43% 


26% 


100% 



TABLE IX: A budget of the sample of Wj" events with a single tagged jet (and containing at least one untagged jet), constructed 
as in the previous table. Each entry shows the fraction of the sample containing a single-tagged jet that was generated from 
the tree-level process labelling the row (as in Table IVTTTl and in which the tagged jet was of the class labelling the column. The 
Wcq and Wcg entries include both c and c quarks; the symbol q stands for u, d, s, u, d, s. The entries in this table are subject 
to an additive uncertainty of ±1 — 2%. The (much larger) systematic uncertainties are discussed in the text. 

to correct for any double counting. For example, we have subtracted the contribution to the 6-jet Wgg entry (from 
g bb in the parton shower) arising from events with kinematic configurations already present in the 6-jet Wbb entry. 
This leads to small corrections, of order the stated ±2% uncertainty, in several of the entries. 

The central feature of these tables is the multitude of contributions to the tagged sample, all of similar magnitude. 
As clearly visible in Tablc lVlTll the large parton-level cross-sections for the light quark and gluon processes are reduced 
by a low tagging fraction, while the much smaller tree-level heavy-quark cross-sections are subject to much larger 
tagging rates. Consequently, within the single-tag sample, all such partonic processes end up contributing at roughly 
at the same level, as is clear in the "after-tagging" column of Table IVIIII and in the "total" column of Table IIXI 
The breakdown of the resulting single-tag sample by the fraction of the sample for which the tagged jet is a bottom 
jet, a charm jet, or a jet without heavy flavor is visible in Table Hxl as the numbers in the bottom row indicate, all 
contributions are again of the same order. 

The simulations were performed with statistics sufficient to ensure such that each entry in Table Hxl is subject to 
an additive statistical uncertainty of order 2-3%. However, the systematic uncertainties in the simulations are much 
larger, due to a host of important physical and technical issues. Since these systematic uncertainties are the most 
important obstacle to an accurate background estimate, we now discuss them in detail. 

Consider first the uncertainties in the basic event simulation. A precision simulation of the Wjj sample cannot, at 
present, be carried out. The parton-level theoretical computation of the differential cross-section for W plus two or 
more high px jets has been advanced in recent years: Wj i ha s been calculated to next-to-leading order, and Wjjj is 
known at leading-order. While the program "MCFM" Uim can provide accurate next-to-leading-order parton-level 
cross-sections, no event generator valid at next-to-leading order currently exists. Consequently, there is at present no 
possibility of simulating this background without significant theoretical uncertainties. 

Moreover, even when a next-to-leading-order event generator becomes available, there are serious questions con- 
cerning showering algorithms that must be addressed. The main problem is associated with the splitting of gluons to 
heavy quarks (or more precisely, with tuning Pythia or Herwig to simulate correctly the process in which a partonic 
gluon generates one or more jets containing charm or bottom mesons.) As illustrated in Table Hxl a substantial frac- 
tion of the single-tagged sample, of order 33% in our simulations, originates from this process. It is not known with 
confidence how often gluons at short distance lead to jets with charm or bottom hadrons, or how often this process 
leads to two jets rather than one. Studies on this issue that compare data from LEP and Tevtron [2^ with QCD 
expectations |2^ and with Pythia and Herwig suggest that there is no serious disagreement. However, this conclusion 
rests on the substantial uncertainties (of order 30%) in both the experimental and the theoretical results. Generally 
the perturbative predictions (including resummed logarithms) and the Monte Carlo results (with default parameters) 
tend to underestimate the observed rates of heavy flavor production in parton showers. Also, the rough agreement 
speaks mainly to overall rates of heavy-flavor production, not to charm-to-bottom ratios or kinematic distributions. 
Given the large sensitivity of the single-tag sample to these effects, it appears that a sizeable uncertainty in both the 
normalization and shape of the Wj"' background arises from this source. 

A related issue is the role of uncertainties in parton distribution functions. The single tag sample receives significant 
contributions from events with charm in the final state that arise from initial states containing non- valence partons. 
Examples include processes such as us W'^uc^ gd W^gc, and Ud — s- W^uc. These processes have varying shapes 



22 



and rates, and depend on the poorly-determined parton distribution functions. 

Even if one could precisely simulate these events, there is still the issue of determining the efficiencies for tagging 
of jets with bottom or charm quarks, and for mistagging of jets with neither. Thus must be done as a function of pt 
and pseudo-rapidity. As indicated in Table Hxl each of these tagging processes plays a comparable role in determining 
the Wjj sample. Uncertainties in tagging functions thus lead to uncertainties in the shape of Wj"" which cannot 
be ignored. An especially serious issue is that the ratio of c to & tagging rates is currently extracted not from data 
but from Monte Carlo programs, which, among other problems, are dependent upon the correct modelling of gluons 
splitting to heavy flavor. 

Clearly, it would be beneficial to decrease the mistagging and charm-tagging efficiencies relative to the efficiency 
for tagging of bottom jets. This would improve the ratio of the signal to the Wj^ background, reducing sensitivity 
to systematic errors, and also would directly suppress some of the main sources of uncertainty in the prediction of 
Wj". However, tuning the 6-tagging algorithm to improve purity of the sample generally comes at the cost of a small 
reduction in the 6-tagging efficiency, which in turn decreases the signal and increases statistical errors. The right 
balance between these competing issues is detector- and luminosity-dependent and must be left to the experimental 
collaborations. 

Another related concern is the subtle linkage between ^-tagging, jet definitions, and gluon splitting. When a gluon 
splits to two heavy quarks, the probabilities to obtain one tagged jet, two tagged jets, or one tagged and one untagged 
jet depend upon all three issues. This may mean that the separation of the samples with one tag versus two tags 
is unstable and not well predicted by theory. It is for this reason that we have chosen to consider samples with 
one-or-more tagged jets. Alternatively, one might require strong angular separation between tagged jets in order to 
retain predictivity. 

A further issue involves the potentially large sensitivity of the shape and normalization of the multi-channel Wj" 
background to the cuts used to bring backgrounds under control. In particular, both the Ht and rrit cuts used in 
the current analysis and in recent experimental papers reshape the Wj" background. While it has been shown 
that jet vetoes are effective at substantially reducing the size of tt backgrounds, we would argue against the use 
of this approach. The systematic errors which jet vetoes introduce into the prediction of the Wj" background has 
not been quantified, but we expect it is prohibitively large. We are especially concerned about the requirement of 
two-and-only-two jets employed in We believe this will make the prediction of Wj" unreliable, both because of 
problems with QCD corrections, and because of failures to correctly model the rate at which gluons in parton-level 
processes lead to zero, one or two jets containing heavy flavor. There is at present no consensus as to the safest method 
for reducing ti, or, for that matter, Wj". We would like to argue strongly that this is a very important problem, 
which our methods simultaneously mitigate and highlight. On the one hand, neither the F+ nor the F- distribution 
is strongly sensitive to the overall size of tt. Consequently, the use of a strong jet veto or harsh Ht cut is unnecessary, 
and indeed unjustified to the extent systematic errors in Wj" become larger as a result. Our approach would instead 
prefer a method which cuts tt less severely, and in a safer fashion, such that theoretical errors in predicting Wj" 
remain small. The Ht cut that we use is, we believe, safer, being a cut on a more inclusive variable; whereas the 
splitting of one jet into two, due to a fiuctuation or a changed jet algorithm, affects a jet veto in a dangerous way, 
this is not so for an Ht cut. But this safety is only relative; there are problems with jets moving above or below the 
minimum pT required for a jet to be included in the variable Ht that we defined. In any case, our method requires 
less focus on reducing the size of tt and more focus on keeping Wj" as small and as predictable as possible. 

With all these problems in view, any method used to extract the Wj" background will have to pass many cross- 
checks. One important consistency check can be carried out by calibrating Monte Carlo simulations using both Wj" 
and the process Z-plus-jets (".Zj""), where the Z decays leptonically. The processes Wj" and Zj" do not have the 
same shapes, rates, and heavy flavor content, so one cannot directly take ratios of distributions or even of overall 
cross-sections. However, the overall kinematics of Zj" is similar to Wj", and is similarly sensitive to all of the above- 
mentioned issues. For these reasons, we expect that matching a Monte Carlo to the rates and shapes of the Zj" 
and Wj" distributions from data, with zero, one or two tagged jets, will significantly reduce systematic uncertainties. 
Such matching will require adjusting parameters which affect the splitting of gluons to heavy flavor, and adjusting 
tagging functions. ^'^ It would also be very helpful if direct measurements of the bottom-content, charm-content, and 
non-fe/non-c content of the various single- and double-tagged Wj" and Zj" samples could be carried out, even with 
low precision and confidence. This would allow direct tests of numerous Monte Carlo predictions that at present have 
very large uncertainties. Studies of the Zb to Zj ratio have been carried out |23|, but the challenging study of Zc 
separately should also be considered as statistics improve. 

In summary, determining the single-tagged Wj" cross-section will require a carefully crafted combination of theory. 



We believe there should be enough Z-plus-jets data, with 3 fb ^ of integrated luminosity, for this analysis to be carried out. 
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data, and theory-optimized Monte Carlo. 

V. SUMMARY 

A counting experiment for discovery of electroweak single-top production appears very challenging. In an effort to 
improve the situation, we have explored the possibility of using the distinctive shape of this process to separate it from 
background. We use as observables the pseudo-rapidity 77^ of the leading-pT untagged jet and the pseudo-rapidity77£ 
of the charged lepton, weighted by the charge of the lepton Q^. (One of these variables was already used in 
Considering the distributions of signal and background in the (77^ ,77^) plane (where fjj = Qirjj and fji = Qirj^), we 
have noted that the distributions for tt and for QCD are largely symmetric, while that of the signal is not; the Wj^ 
(VF-plus-jets) background is intermediate between them. Constructing functions F, F±, defined in Eqs. (PJ-©, which 
have various symmetry properties, we have shown that the statistical and systematic errors in the functions F±, which 
are orthogonal to the function F that would be used in a counting experiment, can be brought close to reasonable 
size without using extreme cuts. 

Here is a summary of key ingredients that went into this analysis, as well as a list of elements which we did not 
account for, and a few of our crucial assumptions: 

• All cross-sections (t6, thq, tt, Wbb, Wcc, Wcq, Wcg, Wqq, Wqg, Wgg) were calculated at tree level. 

• These were then normalized to theoretical calculations at next-to-leading order. For Wj", only the sum of all 
channels was normalized in this way. 

• All processes were run through Pythia, to simulate showering, and through the detector simulation PGS. 

• We imposed hard pt cuts on the leading tagged and untagged jets, and required that a top quark be recon- 
structable from the lepton, tagged jet, and missing energy. We applied an Ht cut aimed at reducing tt] we did 
not use a jet veto. 

• pr-dependence of tagging fractions was accounted for, with the maximal tagging rates at high pT for b,c,q/g 
being taken as 50%, 15%, 1%. The details of the analysis are sensitive to these numbers, as well as to the rate 
for hard gluons at leading order to evolve into jets containing heavy flavor. 

• We did not attempt to simulate QCD events. Instead, we argued QCD effects are (in theory) sufficiently 
symmetric in shape and (from DZero data) sufficiently small in rate that their contributions to all useful 
observables can be neglected (with the possible exception, for DZero, of F^ in the muon channel.) 

• We discussed important shape effects on tt at next-to-leading order , which we expect to be about ten percent 
or less, but did not simulate them. 

It should be noted that we have not optimized our cuts to improve efficiencies and reduce systematics in a rigorous 
way. Indeed, it would not be too useful to do so, since the optimization is a moving target, depending on integrated 
luminosity, on tagging rates and other detector details, on next-to-leading-order shapes, and on Monte Carlo assump- 
tions. We believe, therefore, that our results could be improved upon through such an optimization, though this is 
only worth doing in a concrete analysis. We also have not explored whether other methods of reconstructing rrit might 
be more effective, or whether variables other than Ht might be better as far as both statistics and systematics. This 
is certainly something that should be done as the integrated luminosity increases. 

Moreover, there is a natural extension of our method which we did not consider, but which should be explored 
if the integrated luminosity becomes sufficiently high. Our observable F+ focussed on lepton-jet psuedo-rapidity 
correlations, but as we pointed out, these correlations can have two sources: inherent correlations in the rest frame 
of the tbq system (or of the top quark itself) and correlations which are induced by the boosting of these frames into 
the lab frame. Both of these effects are present in the signal, and they add coherently to give a large contribution to 
F^ . One could imagine measuring the two effects separately. This could potentially allow for even greater separation 
of signal from background. 

We conclude with a summary of and comments upon what we see as the main lessons of our analysis. 

1) Our method largely removes tt and QCD events from the observables F±, making extreme cuts on tt unnecessary, 
and focussing attention on Wj"^ as the main background. While the statistical fluctuations from tt are still important, 
they are of less concern than systematic errors on Wj", since the former scale as the square- root of the tt rate, while 
the latter scale linearly with the Wj"^ rate. Moreover, the ti background is much more safely calculated and simulated. 
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and will be, in the end, easier to remove. In our approach to single top, one should not cut hard on tt if doing so 
causes the systematic uncertainties in Wj^ to increase substantially. In particular, this argues against the use of a 
severe jet veto, in which events with more than two observed jets are discarded. 

2) The method that we have introduced requires that the shapes of ti and Wj"" be properly modeled, but it also 
provides for cross-checks. The function F measures the total background, and, assuming tt can be determined using 
other samples, this allows a measurement of the total VFj" background. Meanwhile, there is a region in the (77^,%) 
plane where the -F_ function gets small contributions from both signal and from tt. This region allows a check of 
whether the shape of Wj" has been correctly understood, as well as being interpretable as a worthwhile test of the 
standard model itself. 

3) The thorniest problem obstructing the measurement of single top is understanding the shape of the Wj" back- 
ground. This is a large and irreducible background which must be subtracted from the signal, even in the context 
of the methods we proposed here. This subtraction could be done directly, using our cuts, but this requires some 
prediction of the shape in the plane. Alternatively, a sideband analysis around the "rat" window cut could be 
applied to F±^ in appropriate pseudo-rapidity windows, but this too requires prediction of the effect of cuts on the 
distribution of Wj"" in "wt" and pseudo-rapidity. 

While theory, Monte Carlo and data all can, and must, assist with these subtractions, many different types of 
uncertainties plague the sample with a single 6-tag (and therefore the sample with one-or-more 5-tags), making it 
unclear how to bring all the available resources together. We believe that a dedicated study, examining the rates, 
shapes, and flavor content (especially of bottom versus charm) of both Wj"" and ^j", with zero, one and two tagged 
jets, will be necessary. This will require a blend of multiple measurements, theoretically precise predictions, and 
careful tuning and cross-checking of Monte Carlo simulations. Since this issue affects many other measurements, 
including the Higgs search and numerous beyond-the-standard-model searches, we view this as a very high priority. 

4) There are very few paths toward reducing the Wj"^ background relative to the signal. One clear need is to decrease 
the mistagging rate and charm-tagging efficiency while maintaining or increasing the 6-tagging efficiency; this would 
both improve signal-to-background and reduce some of the uncertainties that make it difficult to model the background. 
Another important step would be taken if the resolution in reconstructing the top quark mass from the 6, lepton and 
missing energy could be improved. This would allow a narrowing of the mj window cut, which would both reduce the 
size of the Wf'' background and reduce experimental and theoretical uncertainties in any sideband analysis. Beyond 
this, one would need to consider more radical ideas, such as finding methods which could, on average, differentiate 
bottom jets from charm jets, bottom jets from antibottom jets, and/or jets formed by short-distance heavy quarks 
from jets formed by gluons that have split into roughly-collinear heavy quark pairs. 

In conclusion, our method for extracting single-top confirms that one can use the distinctive shape of the signal to 
reduce backgrounds more effectively than in a pure counting experiment. However, we also find that backgrounds are 
much worse than was once thought. Improvements in (mis)tagging rates and in the understanding thereof, careful 
modeling of Il^-plus-jets cross-checked against both theory and data, and more theoretically trustworthy techniques 
for cutting away backgrounds will all be necessary for a robust measurement of single top production. The single-tag 
H^-plus-jets background, in particular, represents a challenge that the whole community must meet head-on. 
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